AITopics | crowd worker

Collaborating Authors

crowd worker

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation

Neural Information Processing SystemsFeb-14-2026, 10:55:35 GMT

The ability to collect a large dataset of human preferences from text-to-image users is usually limited to companies, making such datasets inaccessible to the public. To address this issue, we create a web app that enables text-to-image users to generate images and specify their preferences. Using this web app we build Pick-a-Pic, a large, open dataset of text-to-image prompts and real users'

artificial intelligence, machine learning, pickscore, (18 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)

Industry: Information Technology (0.55)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.85)

Add feedback

d30d0f522a86b3665d8e3a9a91472e28-Supplemental.pdf

Neural Information Processing SystemsFeb-11-2026, 07:30:34 GMT

crowd worker, hypothesis, titan rtx gpus, (14 more...)

Neural Information Processing Systems

Country: North America > United States (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.88)

Add feedback

a96fe863f85c59789bba63588a9557b4-Supplemental-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsFeb-11-2026, 06:18:39 GMT

artificial intelligence, screenshot, solvable-by-human score, (17 more...)

Neural Information Processing Systems

Country:

Oceania > New Zealand (0.04)
Oceania > Australia (0.04)

Industry: Education > Educational Setting (0.47)

Technology: Information Technology > Artificial Intelligence > Vision (0.70)

Add feedback

92249f9233286e437f808fa535d88b26-Supplemental-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsOct-10-2025, 09:40:15 GMT

dataset, huggingface, platform, (13 more...)

Neural Information Processing Systems

Industry: Law (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

On-the-Job Learning with Bayesian Decision Theory

Keenon Werling, Arun Tejasvi Chaganty, Percy S. Liang, Christopher D. Manning

Neural Information Processing SystemsOct-2-2025, 03:30:41 GMT

Our goal is to deploy a high-accuracy system starting with zero training examples. We consider an on-the-job setting, where as inputs arrive, we use real-time crowd-sourcing to resolve uncertainty where needed and output our prediction when confident. As the model improves over time, the reliance on crowdsourcing queries decreases. We cast our setting as a stochastic game based on Bayesian decision theory, which allows us to balance latency, cost, and accuracy objectives in a principled way. Computing the optimal policy is intractable, so we develop an approximation based on Monte Carlo Tree Search. We tested our approach on three datasets--named-entity recognition, sentiment classification, and image classification. On the NER task we obtained more than an order of magnitude reduction in cost compared to full human annotation, while boosting performance relative to the expert provided labels.

classification, learning, query, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.05)
North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > United States > Oregon > Multnomah County > Portland (0.04)
(2 more...)

Industry: Leisure & Entertainment > Games (0.94)

Technology:

Information Technology > Communications > Social Media > Crowdsourcing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

A Proofs

Neural Information Processing SystemsAug-22-2025, 01:13:42 GMT

In this section, we give full proofs of the two main theorems in the paper. 's invertibility and equality 9 follows from Definition 5. Then for By Jensen's inequality, we have: ξ In this section, we give more details of the algorithms we used in the paper. For each i { 1, 2,...,m }, there are n Our code is written with PyTorch. Section 4 and we choose our hyperparameters by the validation performance on the dev sets. The majority of the MNLI corpus is released under the OANC's license, and CMLE method (see Equation 4).

artificial intelligence, hypothesis, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.88)

Add feedback

Redefining Research Crowdsourcing: Incorporating Human Feedback with LLM-Powered Digital Twins

Chan, Amanda, Di, Catherine, Rupertus, Joseph, Smith, Gary, Rao, Varun Nagaraj, Ribeiro, Manoel Horta, Monroy-Hernández, Andrés

arXiv.org Artificial IntelligenceJun-2-2025

Crowd work platforms like Amazon Mechanical Turk and Prolific are vital for research, yet workers' growing use of generative AI tools poses challenges. Researchers face compromised data validity as AI responses replace authentic human behavior, while workers risk diminished roles as AI automates tasks. To address this, we propose a hybrid framework using digital twins, personalized AI models that emulate workers' behaviors and preferences while keeping humans in the loop. We evaluate our system with an experiment (n=88 crowd workers) and in-depth interviews with crowd workers (n=5) and social science researchers (n=4). Our results suggest that digital twins may enhance productivity and reduce decision fatigue while maintaining response quality. Both researchers and workers emphasized the importance of transparency, ethical data use, and worker agency. By automating repetitive tasks and preserving human engagement for nuanced ones, digital twins may help balance scalability with authenticity.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3706599.3720269

2505.24004

Country: North America > United States > New Jersey (0.15)

Genre:

Research Report > New Finding (1.00)
Questionnaire & Opinion Survey (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.93)
Information Technology > Communications > Social Media > Crowdsourcing (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.48)

Add feedback

Prevalence and Prevention of Large Language Model Use in Crowd Work

Communications of the ACMFeb-19-2025, 17:15:44 GMT

Probabilistic classify-and-count, where we calibrated the model6 (see Appendix) and then averaged the LLM probabilities (estimate: 35.2% [29.8%, 40.6%]) Corrected classify-and-count, adjusting for the type I and type II error rates estimated on the training data18 (estimate: 35.4% [27.8%, 43.0%]). We validated our results by analyzing crowd workers' copy-pasting behavior (see Appendix), finding that 55% of the summaries where workers had copy-pasted text were classified as synthetic (that is, LLM probability above 50%) vs.

artificial intelligence, large language model, natural language, (11 more...)

Communications of the ACM

Genre: Research Report > New Finding (0.39)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Scaling Public Health Text Annotation: Zero-Shot Learning vs. Crowdsourcing for Improved Efficiency and Labeling Accuracy

Kazari, Kamyar, Chen, Yong, Shakeri, Zahra

arXiv.org Artificial IntelligenceFeb-9-2025

Public health researchers are increasingly interested in using social media data to study health-related behaviors, but manually labeling this data can be labor-intensive and costly. This study explores whether zero-shot labeling using large language models (LLMs) can match or surpass conventional crowd-sourced annotation for Twitter posts related to sleep disorders, physical activity, and sedentary behavior. Multiple annotation pipelines were designed to compare labels produced by domain experts, crowd workers, and LLM-driven approaches under varied prompt-engineering strategies. Our findings indicate that LLMs can rival human performance in straightforward classification tasks and significantly reduce labeling time, yet their accuracy diminishes for tasks requiring more nuanced domain knowledge. These results clarify the trade-offs between automated scalability and human expertise, demonstrating conditions under which LLM-based labeling can be efficiently integrated into public health research without undermining label quality.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2502.0615

Country: North America > Canada > Ontario > Toronto (0.15)

Genre: Research Report > New Finding (0.48)

Industry: